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We present a variant of the Minority Game in which players who were successful in the previous 
timestep stay with their decision, while the losers change their decision with a probability p. Ana- 
lytical results for different regimes of p and the number of players A'^ are given and connections to 
existing models are discussed. It is shown that for p oc 1/N the average loss is of the order of 1 
and does not increase with as for other known strategies. 
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I. INTRODUCTION 

Game theory decribes situations in which players must 
make decisions, i.e. choose between different alternatives, 
and receive payoffs according to their and the other play- 
ers' choices. The question how players decide on a strat- 
egy, i.e. how they find out what to do if they do not pos- 
sess full information on the strategies of the other players, 
was addressed in Ref. There it was suggested that 
each player has a number of models that prescribe an 
action for a given state of the player's world, for exam- 
ple, for a given game history. The model that has proven 
most successful so far is actually used by the player. 

This approach was applied in the Minority Game in- 
troduced and studied in (|-^]. The rules for this game 
and its variations are as follows: 

• There is an odd number A^ of players. 

• At each time step t each player i makes a deci- 
sion ai{t) G { + 1,-1}, the majority is determined, 

S[t) — sign ^^^^ (Ti(t)^ , and those players who 

are in the minority, ai{t) — —S(t), win, the others 
lose. 

• A measure of global loss is 




(1) 



Random guessing leads to cr^ 



N . 



• The only information accessible to players is the 
history of the majority {S{t - M), . . . , S{t)). In 
many cases, the history can be replaced by a ran- 
dom sequence without essentially affecting the re- 
sults |§. 

• Accordingly, no contracts between players are al- 
lowed. 

In the original Minority Game, each player has a small 
number of randomly picked decision tables that prescribe 



an action for each possible history. Those tables receive 
points according to how well they have predicted the best 
action in the course of the game, and the best table is 
used to actually make the decision. 

Other publications studied variants in which the agents 
used neural networks to make their decisions ||^, or 
in which each agent has a probability that determines 
whether he chooses the action that was successful in the 
last step or its opposite 



II. THE MODEL 

In this paper, we introduce a very simple prescription 
for the agents that still is a reasonable way of behaving 
in the absence of detailed information. It is in some ways 
related to Johnson's model 1^,^ , but different in decisive 
details. The model is this: 

• If an agent i is successful in a given turn, he will 
make the same decision the next turn: ai{t + 1) = 
ai{t) . After all, there's no reason to change any- 
thing. 

• Otherwise, the agent will change his output with a 
probability p : prob {ai{t + I) = —ai{t)) = p. The 
agent is reluctant to give up his position, but even- 
tually, something must change. 

This is evidently a stochastic one-step process and can 
be handled well with the tools for Markov processes. We 
therefore introduce variables to describe an ensemble of 
games. 

Instead of using the whole set {(Ti{t)}^i of time depen- 
dent random variables we consider the stochastic process 



Kit) 
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The possible values k that K{t) can take are half- integer 
and run from —N/2 to N/2 in steps of 1. Then, the 
probabilities 

TTkit) = prob(A'(t) = k) 



1 



together with the transition probabihties 

Wki = prob{K{t + 1) = fc I K{t) = t) 

are the basic quantities to describe the system. 
To shorten notation we consider the probabihties 
7rfc(i) as components of the state vector Tr{t) = 
(7r_7v/2(i), • • ■ 1 i'Af/2(^))"^- The number of players in the 
majority at time t is N/2 + |if Since the individual 
players perform independent Bernoulli trials, the tran- 
sition probability Wu = W{£ k) from a state with 
K{t) — £ to Kit + 1) = fc is given by the binomial distri- 
bution 



JV _ «\ 

- ' j/-^(l-p)f-fc for£<0. (3) 



It is understood that = for to < 0. 

This stochastic process may be considered a random 
walk in one dimension, where steps of arbitrary size with 
probability are allowed only in the direction of the 
origin. 

Given the initial state 7r(0), the state Tv{t) is updated 
at each time step by multiplying it by the transition ma- 
trix W: 



ir{t + l) = W7r(i), 



(4) 



The mathematical theory dealing with this kind of prob- 
lems is that of Markov chains with stationary transition 
probabilities Since (W^)fc^ > 0, the chain is irre- 
ducible as well as ergodic which implies that ir- 
respective of the initial distribution the state 7r(t) con- 
verges for t oo to a unique stationary state 7r(oo) = 
TT*. In view of Eq. (jij) tt* corresponds to an eigenvector 
of W with eigenvalue 1 : 



W7r" = 7r" and ^7r^ = l. 



The properties of this eigenvector, which by the stated 
normalization condition becomes unique, are our main 
interest. 

The problem can be simplified by exploiting the sym- 
metry W^k,-i ~ Wki, which implies the symmetry 
ttIj, = 7r| of the stationary state. Reformulating the 
eigenvalue problem for the independent components of 
tt", the eigenvector can be calculated numerically up to 
N K, 1200 in reasonable time with standard linear alge- 
bra packages. 



We will first consider p = x/{N/2), where x is constant 
and much smaller than N. As N is increased, the number 
of players that switch sides every turn stays constant to 
first order: since the majority is approximately N/2, on 
the average x agents will change their opinion. 

In this case the matrix elements Wke can be approxi- 
mated by Poisson probabilities M : 



Wki ^ Wi', = e'^ 



{l-k)\ 
{k-l)\ 



for ^ > 0, 



for ^ < 0, 



(6) 



where, again, 1/to! for negative m has to be interpreted 
as zero. In the limit N ^ oo we are thus looking for an 
infinite component vector tt** satisfying the eigenvalue 
equation together with the proper normalization: 



and Hk — 1. 



(7) 



Making use of and (|^) we were able to derive equa- 
tions for the moments of the stationary distribution: 
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etc. 



These in turn determine the characteristic function of tt 
and a Fourier transform finally leads to 



1 



(-1). 
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It has been proven that (^) indeed satisfies the eigenvalue 
(5) equation Note that 7r| can be expressed by the 



incomplete gamma function: 



7( 



2 ' 



2a;(|fc| 
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(10) 



A comparison with numerically determined eigenvectors 
of the matrix (ph for = 801 gives excellent agreement, 
as seen in Fig7|l|. The distribution is roughly flat for 
small |fc|, has a turning point near |fc| = a; and falls off 
exponentially with fc for larger values of |fc|. From (||), 
the variance — <^(2fc)^) can be calculated: 



= 1 + Ax- 



(11) 



III. SOLUTION FOR SMALL PROBABILITIES 

A closer look reveals that as A^ ^ cx3, there are two 
scaling regimes for cr^, depending on how p depends on A^. 



For small x, this approaches the optimal value cr^ = 1 
that occurs if the majority is always as narrow as pos- 
sible, but even for larger x, cr^ does not increase with 
A^. 
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FIG. 1. Stationary solution tt^ for p — 2x/N . The numeri- 
cal solution for TV = 801 (symbols) is in good agreement with 
the analytical solution for N ^ oo. 



Numerical calculations show that the eigenvector tt** (k) 
takes the shape of two Gaussian peaks centered at sym- 
metrical distances ±ko from the origin (see Fig. ||). 

The physical interpretation is that the majority 
switches from one side to the other in every time step. 
Since approximately (ko + 1/2)p agents switch sides ev- 
ery turn and the distance between the two peaks amounts 
to a number of 2 kq TV agents, we get kq = p/(4 — 2p). 

This reasoning can be made more precise, and also the 
width of the peaks for large but finite N can be calcu- 
lated by the following argument: The well known normal 
approximation for the binomial coefficients in (|^) leads 
to 



W{ti,\)^NWk 



1 
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exp 



1(^-/(A))^ 
2 



.2(A) 



where /(A) — {I — p)\ — sign(A) 



P 



and s^(A) = 



p(l-p)(i + |A|) 



N 



(13) 



A double gaussian of the form 



IV. SOLUTION FOR LARGE PROBABILITIES 

The other scaling regime assumes that p is of or- 
der one and piV ^ 1. To handle this regime, we 
will use a rescaled (continuous) coordinate k = k/N = 
o-j/(2TV), the range of which is -1/2 < k < 1/2. Mul- 
tiplied by N , the stationary state tt^ for large N turns 
into a probability density function 7r''(K), and the ma- 
trix Wm becomes an integral kernel W{k,\), hence (||) 
is transformed into an integral equation: 

tt^k)^ J W{K,X)TT''{X)dX and jTr'{K)dK = l. 

(12) 
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is transformed by the integral kernel (|l3| ) into a double 
peak of the same type if in the integral equation we ap- 
proximate the variance s'^{X) of ( |l3| ) by s'^{±ko) and if 
the assumption <C Kq is justified. It means that the 
peaks are well separated and that the integral can be ex- 
tended from —oo to oo. By requiring 7r*(K) from ([l^ ) to 
satisfy the eigenvalue equation (12) we get 



P 



2(2 -p) 



and b = 



1-p 



{2-pYN' 



(15) 
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FIG. 2. Stationary solution 7r°(K) for p = 0.4. With in- 
creasing N , the width of the peaks becomes narrower. 



The result for kq confirms the simple argument given 
above, whereas the term for b^ is slightly surprising: it 
does not depend on p in the leading order, i.e. it is not 
simply the number of players who switch sides. Eq. ( [Tsl ) 
also allows to check whether the assumptions made for 
its derivation are true for a given p and N . For exarnple, 
for p — x/N, K^/b'^ for N ^ oo according to (|l^), 
so one cannot expect the formation of double peaks in 
this limit. The crossover from single-peak to double-peak 
distribution occurs for p oc 1/ \/N. 

It is now easy to integrate over the probability distri- 
bution to get an expression for cr^: 



N 



{2-py 



:(TVp2-H4(l-p)). 



(16) 



This holds well if the condition kq ^ ^ is fulfilled, i.e. for 
sufficiently large p and N, as seen in Fig. ^. 
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FIG. 3. for several values of p and A'^, compared to pre- 
dictions by Eq. (p^. 
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V. CONCLUDING REMARKS 

The presented strategy can be related to the decision 
tables of Challet and Zhang's Minority Game as follows: 
if every player keeps only one decision table with entries 
for all possible histories and changes the entries individ- 
ually with a probablility of p if he loses in a given time 
step, the mean result will we exactly the same as for 
the presented one-step memory. A similar argument was 
given for Johnson's variant in |T2[ . The memory size, 
which determines the number of entries in the tables, is 
completely irrelevant for the average loss of each player, 
but does influence the time series of minority decisions 
generated by the system. 

In summary, we have found an analytic solution of a 
stochastic strategy for the Minority Game. Although this 
strategy is very simple, it yields an average loss of order 
one even in the limit of infinitely many agents. Questions 
that will be discussed in future publications include the 
dynamics and relaxation time of the system, interactions 
with players using other strategies and individual prob- 
abilities for each player. 
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